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Abstract — A single-letter characterization is provided for 
the capacity region of finite-state multiple-access channels, 
when the channel state process is an independent and iden- 
tically distributed sequence, the transmitters have access to 
partial (quantized) state information, and complete channel 
state information is available at the receiver. The partial 
channel state information is assumed to be asymmetric 
at the encoders. As a main contribution, a tight converse 
coding theorem is presented. The difficulties associated with 
the case when the channel state has memory are discussed 
and connections to decentralized stochastic control theory 
are presented. 

Keywords: multiple-access channel, asymmetric chan- 
nel state information, decentralized stochastic control, 
non-nested information structure. 

I. Introduction and Literature Review 

Wireless communication channels and Internet type 
networks are examples of channels where the channel 
characteristics are time-varying. In wireless channels, 
the mobility of users and changes in landscape as well 
as interference may lead to temporal variations in the 
channel quality. In network applications, user demand 
and node failure may affect the channel reliability. Such 
channel variation models may include fast fading and 
slow fading; in fast fading, the channel state is assumed 
to be changing for each use of the channel. On the 
other hand, in slow fading, the channel is assumed to 
be constant for each finite block length. 

In such problems, the channel state can be transmit- 
ted to the encoders either explicitly, or through output 
feedback. Typically the feedback is not perfect, that is 
the encoder has only partial information regarding the 
state or the output variables. The present paper studies 
a particular case, finite-state multiple-access channels 
(MACs), where partial channel state information (CSI) 
is provided to the encoders causally. What makes such 
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setup particularly interesting is the fact that the par- 
tial CSI available to the two transmitters is in general 
asymmetric, i.e., none of the transmitters' CSI contains 
the CSI available to the other one. On the other hand, 
we assume that the receiver has access to perfect state 
information. 

A single-letter characterization of the capacity region 
is provided for the case of independent and identically 
distributed (i.i.d.) channel state sequences. As we shall 
review shortly, results in the literature have already 
provided achievability results for such problems. The 
main contribution of this paper consists in providing a 
tight converse theorem. Our proof involves showing that 
restricting to encoders using only the quantized CSI on 
the current state does not cause any loss of optimality 
with respect to the most general class of admissible 
encoders using all the quantized CSI causally observed 
until a given time. 

The problem at hand can be thought of as a decen- 
tralized stochastic control problem. We shall elaborate 
on this connection in the concluding section, where we 
shall also discuss in what our arguments fail when trying 
to extend them to a proof of the converse theorem for 
finite-state MACs with memory, and asymmetric CSI at 
the transmitters. 

Let us now present a brief literature review. Capacity 
with partial channel state information at the transmitter 
is related to the problem of coding with unequal side 
information at the encoder and the decoder. The capacity 
of memoryless channels, with various cases of state 
information being available at neither, either or both the 
transmitter and receiver, has been studied in [ 1 3 1 and [T| . 
Reference [14] develops a stochastic control framework 
for the computation of the capacity of channels with 
memory and complete noiseless output feedback via the 
properties of the directed mutual information. Reference 
[:8:| considers fading channels with perfect channel state 
information at the transmitter, and shows that with in- 
stantaneous and perfect state information, the transmitter 
can adjust the data rates for each channel state to 
maximize the average transmission rate. Viswanathan 
|,17J relaxes this assumption of perfect instantaneous 
state information, and studies the capacity of Markovian 
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Fig. 1: Finite-state multiple-access channel with asym- 
metric partial state information at the transmitters. 



channels with delayed information. Reference ||4l studies 
the capacity of Markov channels with perfect causal 
state information. The capacity of Markovian, finite-state 
channels with quantized state information available at the 
transmitter is studied in ||20| . 

The works most closely related to ours are |6| and 
||9l . In the capacity of general finite-state MACs 
with different levels of causal CSI at the transmit- 
ters is characterized in terms of multi-letter formulas. 
Moreover, single-letter characterizations are provided for 
the capacity of finite-state MACs when the decoder 
has perfect CSI and the encoders are restricted to use 
only a finite window of, possibly limited, CSI; the 
capacity region without any such restriction is recov- 
ered in the limit of large window size. Reference [9J 
develops a general framework for approximating, and 
possibly characterizing, the capacity of channels with 
causal, and non-causal CSI: in particular. Theorem 4 
therein provides a single-letter characterization of the 
capacity region of a MAC with independent CSI at the 
transmitters. With respect to IS), (|9l, the present paper 
considers the somewhat simpler case of a MAC with 
i.i.d. state, where the encoders have causal, asymmetric, 
partial CSI, which is obtained through fixed quantizers 
acting componentwise. In contrast to IS), a single-letter 
expression for the capacity region is obtained in this 
case without any finite window restriction on the CSI 
available to the transmitters, while, differently from |j9l, 
the CSI available to the transmitters is not assumed to 
be independent. Recent related work also includes ifTTI . 
providing an infinite-dimensional characterization for 
the capacity region for Multiple Access Channels with 
feedback, and [3J, studying the case of MAC channels 
where the encoders have access to coded non-causal state 
information. 



The rest of the paper is organized as follows. In 
Section |ll] a formal statement of the problem and the 
main results are presented, consisting in a single-letter 
characterization of the capacity region of finite-state 
MACs with i.i.d. state. Section |lll] contains the proof 
of achievability of the capacity region, while Section 
HV] presents a proof of the converse coding theorem. 
Finally, in Section [V] we discuss the issues arising when 
trying to generalize our arguments to the memory case, 
and present some final remarks on the connections of 
this problem with the decentralized stochastic control 
literature. 

II. Capacity of i.i.d. finite-state MAC with 

ASYMMETRIC PARTIAL CSI 

In the following, we shall present some notation, 
before formally stating the problem. For a vector v, and 
a positive integer i, Vi will denote the i-th entry of v, 
while = {vi, . . . ,Vi) will denote the vector of the 
first i entries of v. Following a common convention, 
capital letters will be used to denote random variables 
(rv.s), and small letters denote particular realizations. 
We shall use the standard notation H( • ), and I( • ; • ) 
(respectively H( • | • ), and I( ■ ; • | • )) for the (conditional) 
entropy and mutual information of r.v.s. With a slight 
abuse of notation, for < x < 1, we shall write 
H(a;) for the entropy of x. For a finite set A, V{A) 
will denote the simplex of probability distributions over 
A. Finally, for a positive integer n, we shall denote 
by U(Ks<n °^ ^strings of length 

smaller than n. Q 

We shall consider a finite-state MAC with two trans- 
mitters, indexed by i e {a, 6}, and one receiver Trans- 
mitter i aims at reUably communicating a message Wi, 
uniformly distributed over some finite message set Wi, to 
the receiver. The two messages Wa and Wb are assumed 
to be mutually independent. We shall use the notation 
W := {Wa: Wb) for the vector of the two messages. 

The channel state process is modeled by a sequence 
S = {St : i = 1,2,...} of independent, identically 
distributed (i.i.d.) r.v.s, taking values in some finite- 
state space S, and independent from W; the probability 
distribution of any St is denoted by P( • ) G T-'iS)- 
The two encoders have access to causal, partial state 
information: at each time t > 1, encoder i observes 
Vt^^ = qi{St), where : 5 — > Vi is a quantizer mod- 
eling the imperfection in the state information. We shall 
denote by Vt := {v}°'\v}^'') the vector of quantized 
state observations, taking values in V :— Va ^ Vt- The 

(i) 

channel input of encoder i at time t, X}. , takes values 
in a finite set Xi, and is assumed to be a function of 

'This includes the empty string, conventionally assumed to be the 
only element of A''^ . 
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the locally available information {Wi, ^|]'^)- The symbol 

Xt = {xj:"^ , X( ) will be used for the vector of the two 
channel inputs at time t, taking values in X := Xa x Xi,. 
The channel output at time t, Yt, takes values in a finite 
set 3^; its conditional distribution satisfies 

P{Yt^y\W^w, X[t] ^X[t],S[t]^S[t])^P{yt\st,xt) 

(1) 

where, for any s G 5, and x ^ X, P{- \s,x) G P{y) 
is an output probability distribution. Finally, the de- 
coder is assumed to have access to perfect causal state 
information (which may be known causally or non- 
causally); the estimated message pair will be denoted 

We now present the class of transmission systems. 

Definition 1: For a rate pair R ~ {Ra,Rb), a block- 
length n > 1, and a target error probability e > 0, 
an (i?, n, e)-coding scheme consists of two sequences 
of functions 

{4'^ : X V| ^ X,}i<t<n , 
and a decoding function 

such that, for i G {a, b}, I < t < n: 
• |W,| > 2-^'"; 

. W^:=^(5[„],r[„]); 
. P{W ^W)<e. 

We now proceed with the characterization of the 
capacity region. 

Definition 2: A rate pair R — {Ra,Rb) is achievable 
if, for all e > 0, there exists, for some n > 1, an 
(i?, n, £)-coding scheme. The capacity region of the 
finite-state MAC is the closure of the set of all achievable 
rate pairs. 

We now introduce what we call memoryless stationary 
team policies and their associated rate regions. 

Definition 3: A memoryless stationary team policy is 
a family 

TT = {n,{ ■ \v,) G riX,)\i G {a, b}, G VJ (2) 

of probability distributions on the two channel input 
sets conditioned on the quantized observation of each 
transmitter For every memoryless stationary team pol- 
icy TT, TI{ti) will denote the region of all rate pairs 
R = {Ra,Rb) satisfying 

< i?a < l{Xa;Y\Xb,S) 

< Rb < l{Xb;Y\Xa,S) (3) 

< Ra + Rb < liX;Y\S), 

where S, X = {Xa,Xb), and Y, are r.v.s taking values 
in S, X, and y, respectively, and whose joint probability 



distribution 

i^{s, X, y) := P{S ^ s,X = x,Y ^ y) 
factorizes as 

iy{s,x,y) = P(s)'!ra{xa\qa{s))TTbixb\qbis))P{y\s,x) . 

(4) 

We can now state the main result of the paper. 
Theorem 4: The achievable rate region is given by 

CO (u^nn)) , 

the closure of the convex hull of the rate regions associ- 
ated to all possible memoryless stationary team policies 
TT as in (|2]|. 

In Section Hn] we shall prove the direct part of Theorem 
m namely that every rate pair R G co (U7r7?.(7r)) is 
achievable. In Section |IV] we shall prove the converse 
part, i.e. that no rate pair R G \ co (U^7?.(7r)) is 
achievable. 

III. ACHIEVABILITY OF THE CAPACITY REGION 

The result on achievability is known, and follows, 
e.g., from 1*61. For convenience, we briefly sketch a 
different approach, as suggested at the beginning of |j9] 
Sect. VI]. The main idea consists in considering an 
equivalent MAC having as input mappings form the CSI 
information available at the transmitters to the original 
MAC'S input. Specifically, one considers an equivalent 
memoryless MAC having output space Z := S x y 
coinciding with the product of the state and output space 
of the original MAC, input spaces Ui := {ui : V,: — > Xi}, 
for i G {a, b}, and transition probabilities 

Q{z\ua, Ub) P{s)P{y\ua{qa{s)),Ub{qb{s))) , 

where z — (s, y). Then, a standard arguments shows that 
the rate region 

Ra < l{Ua-Z\Ub) 

Rb < l{Ub;Z\Ua) 
Ra + Rb < 1{U;Z), (5) 

is achievable on this MAC, where U — {Ua, Ub) and Z 
are random variables whose joint distribution factorizes 
as 

P{Ua, Ub, Z) - |2aiUa)^ibiUb)QiZ\Ua, Ub) , (6) 

for some G 'Pii^a), and ^b G 'Pi^b)- Now, one can 
restrict himself to choosing probability distributions /i^ 
in V{Ui) ^ 'Pi^f') with the product structure 

^Jii{u^) = '^:^{u^{Vi)\vi) , Ui : ^ X^ , (7) 

where i G {a, b}, and vr is some memoryless stationary 
team policy, as in (|2]l. Then, to any triple of rv.s 
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{Ua, Ub, Z), with joint distribution as in (|6]l, one can nat- 
urally associate random variables S, Xa := Ua{qa{S)), 
Xb ■= Ub{qb{S)), and Y, whose joint probability distri- 
bution satisfies (|4|i. Moreover, it can be readily verified 
that 

l{Xa;Y\S,Xb) = l{Ua;Z\Ub) 
liXb;Y\S,Xa) = l{Ub:Z\Ua) (8) 
= 1{U;Z). 

Hence, if a rate pair R = {Ra,Rb) belongs to the rate 
region Tl{Tr) associated to some memoryless stationary 
team policy tt (i.e. if it satisfies (|3]l), then R satisfies 
(|5]i for the product probability distributions /i^, /if, de- 
fined by (|7|i- This in turn implies that the rate pair is 
achievable on the original finite-state MAC P. The proof 
of achievability of the capacity region co(U,r7?.(7r)) then 
follows from a standard time-sharing principle (see, e.g., 
[Sj Lemma 2.2, p. 272]). 

IV. Converse to the coding theorem 

In this section, we shall prove that no rate outside 
co(U,r7?.(7r)) is achievable. Lemma |5] shows that any 
achievable rate pair can be approximated by convex 
combinations of (conditional) mutual information terms. 
For £ G [0, 1], define 

^(e) log 13^1 + H£l, (9) 



and observe that 



lim ri{e) = . 

e— »0 



(10) 



For every t > 1, and a E ^, define 
1 



n ^ ' 



(11) 



Clearly, aa- > 0, and 



l<t<n creS*- 



(12) 

Lemma 5: For a rate pair R G M^, a block-length 
n > 1, and a constant e £ (0, 1/2), assume that there 
exists a (i?, n, e)-code. Then, 

Ra+Rb< J2 0^crliXt;Yt\St,S[t-l]=fT)+^{s) 

(13) 

Ra< (^<rliX^t"^;Yt\X^''\St,S[t-^=<r)+r^{e). 

(14) 

(15) 



Proof: By Fano's inequality we have the following 
estimate of the residual uncertainty on the messages 
given the full decoder's observation 

i/(M/|r[„];5[„]) < H(e)+elog(|Wa||>Vb|). 

(16) 

For 1 < i < n, we consider the conditional mutual 
information term 

At :=I(iy;yt,5t+i|y[t_i],5[t]), 

and observe that 

H(W/|5i)-H(T4^|5[„+i],y[„]) 



l<t<n 



= log(|>Va||Wfc|)-H(W^|5[„],Y[„]) 



(17) 

since the initial state 5*1 is independent of the message 
pair W, and the final state 5'„+i is conditionally indepen- 
dent of W given (S'[„] , F[„]). On the other hand, using 
the conditional independence of W from St+i given 
{S[t],Y[t]), one gets 

At = liW;Yt,St+i\Y[t_,],S[t]) 
= l{W;Yt\Y[t_,^,S[t]) 

< R{Yt\S[t])-il{Yt\W,S[t]) 
= l{W;Yt\S[t^), 

(18) 

where the above inequality follows from the fact 
that R{Yt\Y[t_^, S[t]) < il{Yt\S[t]), since removing 
the conditioning does not decrease the entropy, while 
H(ytH^,y[t_i],5[t]) = R{Yt\W,S^tf), as Yt is condi- 
tionally independent from ^[t-i] given {W,S[t]), due 
to the absence of output feedback. Since {W,S[t]) — 
{Xt,St) — Yt forms a Markov chain, the data processing 
inequality implies that 

liW;Yt\S[t]) <l{XuYt\S[t]). (19) 
By combining ( fT6]) . ( fTTb . ( fTSl ) and ( fT9] l, we then get 

Ra + Rb < -logdWallWbl) 



< 



1 — e n 



< - Y KXf,Yt\S[t])+vie). 

l<t<n 

Moreover, observe that 

l{Xt;Yt\S[t]) = Y nS[t-i] = (t)xc 



(20) 



n 



where x<t := l{Xt]Yt\StT S[t-i] — c). Substituting into 
yields (fTsl l. 
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Analogously, let us focus on encoder a: by Fano's 
inequality, we have that 

H(W„|r[„],5[„]) <H(£)+£log(|Wa|). (21) 

For t > 1, define 

A^'') liWa;Yt,St+,\Wf„ >^[t-i], 5[t]) , 
and observe that 
E A*"^' =H(M^,|5i,W^f,)-H(W^a|M^b,5[„+i],Y[„]) 

l<t<n 

>iog|>Va|-H(w^„|5[„],y[„]), 

(22) 

where the last inequality follows from the independence 
between Wa, Si, and Wb, and the fact that removing 
the conditioning does not decrease the entropy. Now, we 
have 

= l{Wa;Yt\Wb,Y[t_i],S[t]) 

= R{Yt\Wb,Y[t^^,S[t])~R{Yt\W,Y[t_i],S[t]) 

< RiYt\Wb,Sit])~ll{Yt\W,S[t]) 

= l{Wa;Yt\Wi„S[t]), 

(23) 

where the inequality above follows from the fact that 
liiYt\Wb,Y[t-i],Sit]) < RiYtlWb, S^t]) since removing 
the conditioning does not decrease the entropy, and that 
R(Yt\W,Y[t_^,S[t^) = R{Yt\W,S[t]) due to absence of 
output feedback. Observe that, since, conditioned on Wb 
and S[t] (hence, on X^''''), Wa — xj:"'^ — Yt forms a 
Markov chain, the data processing inequality implies that 

l{Wa;Yt\Wb,S[t]) < 

(24) 

By combining (ISTT l. (|22] |. (|23] l, and $24[ . one gets 

Ra < -log\Wa\ 

n 

^ ' l<t<n ^ ' 

<-Y. I(x('^);y,|x(^\5[,j)+r;(e) 

\<t<n 

= J2 c..l{X^t"^;Yt\xi''\St,S[t^i]=a)+7j{e), 

which proves ( fT4] l. 

In the same way, by reversing the roles of encoder a 
and b, one obtains ( fTSl l. ■ 

For t > 1, let us fix some string cr G 5*^^, and focus 
our attention on the conditional mutual information terms 
liXf, Yt\St, S[t-i]^a), l{X^'''>;Yt\X^''\ St, S[t-i]=(T), 
and llXt''^ ;Yt\Xt"'\St, S[t^i] — cr), appearing in the 
rightmost sides of (flJl l, (fT4l i. and (flSl l, respectively. 
Clearly, the three of these quantities depend only on the 



joint conditional distribution of current channel state St, 
input Xf, and output Yt, given the past state realization 
S[t-i] = Hence, the key step now consists in showing 
that 

Utris, X, y) := f{St = s,Xt^ x, Yt = 2/|5'[t_i] = cr) 

(25) 

factorizes as in This is proved in Lemma |6] below. 

For Xi G Xi, Vi £ Vi, and cr G 5*^^, let us consider 
the set T'^i.\x^,v,) C W,, 

T'^^\xi,Vi):^^Wi: <p[^\wi,qi{ai), . . . ,q{crt-i),Vi)^Xi^ 
and the probability distribution Tr^\ ■ \vi) G V{Xi), 

Lemma 6: For every 1 < t < n, cr E S*'^^, s G S, 
Xa G Xa, and Xb G Xb, 

uUs,x,y) = P{s)7rl-\xa\qa{s))7ri'Hxb\qb{s))P{y\s,x) 

(26) 

Proof: First, observe that 
Ua-is,x,y) = P{St = s\S[t-i] ^ (T)va-{x\s)P{y\s,x) 
= P{s)iya-ix\s)P{y\s,x) 

(27) 

where i/cr(a;|s) := P{Xt = x\S[t] = {cr, s)). The former 
of the equalities in dZTl ) follows from while the 
latter is implied by the assumption that the channel state 
sequence is i.i.d.. 

Now, recall that, for i G {a, b}, the current input 
satisfies ^ </>f ^ (W,, Vj[|^). For w e W, let 

:= FiXt = x\S[t] - (t, s), = w). Then, 

iy^{x\s) = V e»P(W^ = u;|5[t] = (cr,s)) 

= E \^a\-'\Wbr'^^, 

E \^^r' E iW"!" 

= 7ri°^(a;a|ga(s))7ri''^(a;b|q'b(s)) , 

(28) 

the second inequality above following from the mutual 
independence of S[t], Wa, and Wb. The claim now 
follows from ^ and ■ 

Let us now fix an achievable rate pair R = {Ra, Rb)- 
By choosing {R, n, e)-codes for arbitrarily small e > 0, 
the inequalities ( fT3]) . ( fT4b . and ( fTSl ). together with ( fTOl) 
and (fT2l) . imply that {Ra,Rb) can be approximated by 
convex combinations of rate pairs (indexed by cr G 5*^"^) 
satisfying ^ for joint state-input-output distributions as 
in ( |25] |. Hence, any achievable rate pair R belongs to 
co(U^7e(7r)). 
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Remark 1: For the validity of the arguments above, 
two critical steps were (l27T i. where the hypothesis of i.i.d. 
channel state sequence has been used, and ( l28T l, which 
only relies on the mutual independence of W and S[t], 
this being a consequence of the assumption of absence 
of inter-symbol interference. In particular, the key point 
in (|27] | is the fact that the past state realization cr appears 
in i^cr only and not in P (St). o 

Remark 2: For the validity of the arguments above, 
it is critical that the receiver observes the channel 
state. More in general, it would suffice that the state 
information available at the decoder contains the one 
available at the two transmitters. In this way, the decoder 
does not need to estimate the coding pohcies used in a 
decentralized time-sharing. o 

V. Extensions to channels with memory and 

CONCLUDING REMARKS 

The present paper has dealt with the problem of reli- 
able transmission over finite-state multiple-access chan- 
nels with asymmetric, partial channel state information 
at the encoders. A single-letter characterization of the 
capacity region has been provided in the special case 
when the channel state is a sequence of independent and 
identically distributed random variables. 

It is worth commenting to which extent the results 
above can be generalized to channels with memory. Let 
us consider the case when the channel state sequence 
{St : t = 1,2,...} forms a Markov chain with 
transition probabiUtiesP(5f+i — s+\St = s) = P(s+|s) 
which are stationary and satisfy the strongly mixing 
condition P(s+|s) > for all s,s+ £ S. Further, 
assume that there is no inter-symbol interference, i.e. 
{St t — 1,2, . . .} is independent from the message W, 
and that the state process is observed through quantized 
observations v/*"* = qi{St), as discussed earlier. 

For the generation of optimal policies in a multi- 
person optimization problem, whenever a dynamic pro- 
gramming recursion via the construction of a Markov 
Chain with a fixed state space is possible (see [19] 
for a review of information structures in decentralized 
control), the optimization problem is computationally 
feasible and the problem is said to be tractable. In a large 
class of decentralized control problems, however, one 
faces intractable optimization problems. Let us elaborate 
on this further 

In team decision problems, one may assume that 
there is an a priori agreement among the decentraUzed 
decision makers on who will do what, when the random 
variables take place. This approach is based on Witsen- 
hausen's equivalent model for discrete stochastic control 
lUSl, and makes the point that, indeed, all dynamic team 
problems are essentially static, with a much larger state 
space. 



In the case of finite-state multiple-access channels 
with independent and identically distributed state se- 
quences, by first showing that the past information is 
irrelevant, we observed that we could limit the memory 
space on which the dynamic optimization is performed. 
This is because, as observed in Remark 1, in the right- 
most side of (|27] | the past state realization cr affects only 
the control i/ct(x|s), but not the current state distribution 
P{St)- In contrast, when the state sequence is a Markov 
chain, the past state realization cr does affect both the 
control i^a-ixls) as well as the current state distribution 
P{St\S[t-i] = cr). It is exactly such a joint dependence 
which prevents the proof presented here to be general- 
ized to the Markov case. 

Let us have a brief discussion for the case where 
there is only one transmitter. In this case, the conditional 
probability distribution of the state given the observation 
history, nt{ ■ ) := P{St = ■ \V[t]) e ViS), can be shown 
to be a sufficient statistic, i.e. the optimal coding policy 
can be shown to depend on it only. As a consequence, 
the optimization problem is tractable. Such a setting was 
studied in |20|, where the following single-letter char- 
acterization was obtained for the capacity of finite-state 
single-user channels with quantized state observation at 
the transmitter and full state observation at the receiver: 

C:=/dP(7r) sup \y2l{X;Y\s,7T)P{s\7T)\ 
Jv{s) P(x\^)ev(x) y-^ J 

where P{s,tt) := P{s\tt)P{tt) denotes the asymptotic 
joint distribution of the state St and its estimate Ilf, 
existence and uniqueness of which are ensured by the 
strong mixing condition. 

For finite-state multiple-access channels with memory, 
a similar approach can successfully be undertaken only 
if the state observation is symmetric, namely if Qa = Qb- 
Indeed, in this case, the conditional state estimation 
nt(.) = FiSt = •l^if^) = nSt = -iV^t?) can 
be shown to be a sufficient statistic, and a single-letter 
characterization of the capacity region can be proved. 

However, for the general case when the channel state 
sequence has memory and the state observation is asym- 
metric (i.e. Qa / Qb), the construction of a Markov 
chain (which would not incur a loss in performance) 
is not straightforward. The conditional measure on the 
channel state is no longer a sufficient statistic: In par- 
ticular, if one adopts a team decision based approach, 
where there is a fictitious centralized decision maker, 
this decision maker should make decisions for all the 
possible memory realizations, that is the policy is to map 
the vai-iables {W, Vj^J"', V[||'^ ) to ^) decenti-ally, 

and the memory cannot be truncated, as every additional 
bit is essential in the construction of an equivalent 
Markov chain to which the Markov Decision Dynamic 
Program can be applied; both for the prediction on the 
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channel state as well as the belief of the coders on each 
other's memory. Let us also elaborate a discussion in 
view of common knowledge of Aumann |[T|: Information 
between two decision makers is common knowledge if it 
is measurable with respect to the sigma-fields generated 
by both of the local information variables at the decision 
makers. It is not usual in practical applications that all 
the local information is common knowledge. In such 
scenarios, one approach is to have the decision makers 
compute the conditional probability measures for the 
exogenous random variables and the actions of other 
decision makers for generating their optimal actions. For 
example, in the context of our problem in the paper, if 
we look for such person-by-person optimal policies, a 
policy of one of the encoders (say Encoder a) which 
uses the past will force the other encoder (Encoder b) to 
also use the past to second-guess the action of Encoder 
a, which requires the use of a policy with memory. 
Thus, adopting a person-by-person policy does not lead 
to useful structural results, in our context. 

We instead adopted Witsenhausen's equivalent model 
to generate team policies, as also elaborated in pT9), by 
having the encoders agree on which policies to adopt 
before random variables are realized. The approach in 
our paper showed that we can obtain a direct result when 
the channel state sequence is memoryless. However, 
when the channel state has memory, the past informa- 
tion provides useful information which is important for 
estimating the future channel states. As such, we cannot 
avoid the use of the information on the past channel 
state realizations. If one is to construct an equivalent 
state based on which coding policies are generated, 
the equivalent state needs to keep growing with time: 
The discussion in f6l provides such a block-level char- 
acterization and it seems we cannot go beyond this 
due to the non-tractability of the optimization problem. 
We note that if the encoders can exchange their past 
observations with a fixed delay, if they can exchange 
their observations periodically, or if they can exchange 
their beliefs at every time stage, then the optimization 
problem will be tractable. 

One question of important practical interest is the 
following: If the channel transitions form a Markov 
chain, which is mixing fast, is it sufficient to use a 
finite memory construction for practical purposes? This 
is currently being investigated. 
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